Querying incomplete data is an important task both in data management, and in many AI applications that use query rewriting to take advantage of relational database technology. Usually one looks for answers that are certain, i.e., true in every possible world represented by an incomplete database. For positive queries - expressed either in positive relational algebra or as unions of conjunctive queries - finding such answers can be done efficiently when databases and query answers are sets. Real-life databases however use bag, rather than set, semantics. For bags, instead of saying that a tuple is certainly in the answer, we have more detailed information: namely, the range of the numbers of occurrences of the tuple in query answers. We show that the behavior of positive queries is different under bag semantics: finding the minimum number of occurrences can still be done efficiently, but for maximum it becomes intractable. We use these results to investigate approximation schemes for computing certain answers to arbitrary first-order queries that have been proposed for set semantics. One of them cannot be adapted to bags, as it relies on the intractable maxima of occurrences, but another scheme only deals with minima, and we show how to adapt it to bag semantics without losing efficiency.

On querying incomplete information in databases under bag semantics / Console, M.; Guagliardo, P.; Libkin, L.. - In: IJCAI. - ISSN 1045-0823. - 0:(2017), pp. 993-999. (Intervento presentato al convegno 26th International Joint Conference on Artificial Intelligence, IJCAI 2017 tenutosi a Melbourne) [10.24963/ijcai.2017/138].

On querying incomplete information in databases under bag semantics

Console M.
;
Guagliardo P.;Libkin L.
2017

Abstract

Querying incomplete data is an important task both in data management, and in many AI applications that use query rewriting to take advantage of relational database technology. Usually one looks for answers that are certain, i.e., true in every possible world represented by an incomplete database. For positive queries - expressed either in positive relational algebra or as unions of conjunctive queries - finding such answers can be done efficiently when databases and query answers are sets. Real-life databases however use bag, rather than set, semantics. For bags, instead of saying that a tuple is certainly in the answer, we have more detailed information: namely, the range of the numbers of occurrences of the tuple in query answers. We show that the behavior of positive queries is different under bag semantics: finding the minimum number of occurrences can still be done efficiently, but for maximum it becomes intractable. We use these results to investigate approximation schemes for computing certain answers to arbitrary first-order queries that have been proposed for set semantics. One of them cannot be adapted to bags, as it relies on the intractable maxima of occurrences, but another scheme only deals with minima, and we show how to adapt it to bag semantics without losing efficiency.
2017
26th International Joint Conference on Artificial Intelligence, IJCAI 2017
Incomplete Data; Bag Semantics
04 Pubblicazione in atti di convegno::04c Atto di convegno in rivista
On querying incomplete information in databases under bag semantics / Console, M.; Guagliardo, P.; Libkin, L.. - In: IJCAI. - ISSN 1045-0823. - 0:(2017), pp. 993-999. (Intervento presentato al convegno 26th International Joint Conference on Artificial Intelligence, IJCAI 2017 tenutosi a Melbourne) [10.24963/ijcai.2017/138].
File allegati a questo prodotto
File Dimensione Formato  
Console_On-Querying_2017.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 136.43 kB
Formato Adobe PDF
136.43 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1561276
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 12
social impact